89 research outputs found

    Next generation sequencing analysis reveals a relationship between rDNA unit diversity and locus number in Nicotiana diploids

    Get PDF
    © 2012 Matyášek et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited

    Genome-wide repeat dynamics reflect phylogenetic distance in closely related allotetraploid Nicotiana (Solanaceae)

    Get PDF
    Nicotiana sect. Repandae is a group of four allotetraploid species originating from a single allopolyploidisation event approximately 5 million years ago. Previous phylogenetic analyses support the hypothesis of N. nudicaulis as sister to the other three species. This is concordant with changes in genome size, separating those with genome downsizing (N. nudicaulis) from those with genome upsizing (N. repanda, N. nesophila, N. stocktonii). However, a recent analysis reflecting genome dynamics of different transposable element families reconstructed greater similarity between N. nudicaulis and the Revillagigedo Island taxa (N. nesophila and N. stocktonii), thereby placing N. repanda as sister to the rest of the group. This could reflect a different phylogenetic hypothesis or the unique evolutionary history of these particular elements. Here we re-examine relationships in this group and investigate genome-wide patterns in repetitive DNA, utilising high-throughput sequencing and a genome skimming approach. Repetitive DNA clusters provide support for N. nudicaulis as sister to the rest of the section, with N. repanda sister to the two Revillagigedo Island species. Clade-specific patterns in the occurrence and abundance of particular repeats confirm the original (N. nudicaulis (N. repanda (N. nesophila ? N. stocktonii))) hypothesis. Furthermore, overall repeat dynamics in the island species N. nesophila and N. stocktonii confirm their similarity to N. repanda and the distinctive patterns between these three species and N. nudicaulis. Together these results suggest that broad-scale repeat dynamics do in fact reflect evolutionary history and could be predicted based on phylogenetic distance

    Independent Domestication of Two Old World Cotton Species

    Get PDF
    Domesticated cotton species provide raw material for the majority of the world\u27s textile industry. Two independent domestication events have been identified in allopolyploid cotton, one in Upland cotton ( Gossypium hirsutum L.) and the other to Egyptian cotton ( Gossypium barbadense L.). However, two diploid cotton species, Gossypium arboreum L. and Gossypium herbaceum L., have been cultivated for several millennia, but their status as independent domesticates has long been in question. Using genome resequencing data, we estimated the global abundance of various repetitive DNAs. We demonstrate that, despite negligible divergence in genome size, the two domesticated diploid cotton species contain different, but compensatory, repeat content and have thus experienced cryptic alterations in repeat abundance despite equivalence in genome size. Evidence of independent origin is bolstered by estimates of divergence times based on molecular evolutionary analysis of f7,000 orthologous genes, for which synonymous substitution rates suggest that G. arboreum and G. herbaceum last shared a common ancestor approximately 0.4–2.5 Ma. These data are incompatible with a shared domestication history during the emergence of agriculture and lead to the conclusion that G. arboreum and G. herbaceum were each domesticated independently

    Time-calibrated phylogenetic trees establish a lag between polyploidisation and diversification in Nicotiana (Solanaceae)

    Get PDF
    We investigate the timing of diversification in allopolyploids of Nicotiana (Solanaceae) utilising sequence data of maternal and paternal origin to look for evidence of a lag phase during which diploidisation took place. Bayesian relaxed clock phylogenetic methods show recent allopolyploids are a result of several unique polyploidisation events, and older allopolyploid sections have undergone subsequent speciation at the polyploid level (i.e. a number of these polyploid species share a singular origin). The independently formed recent polyploid species in the genus all have mean age estimates below 1 million years ago (Ma). Nicotiana  section Polydicliae (two species) evolved 1.5 Ma, N. section Repandae (four species) formed 4 Ma, and N. section Suaveolentes (*35 species) is about 6 million years old. A general trend of higher speciation rates in older polyploids is evident, but diversification dramatically increases at approximately 6 Ma (in section Suaveolentes). Nicotiana sect. Suaveolentes has spectacularly radiated to form 35 species in Australia and some Pacific islands following a lag phase of almost 6 million years. Species have filled new ecological niches and undergone extensive diploidisation (e.g. chromosome fusions bringing the ancestral allotetraploid number, n = 24, down to n = 15 and ribosomal loci numbers back to diploid condition). Considering the progenitors of Suaveolentes inhabit South America, this represents the colonisation of Australia by polyploids that have subsequently undergone a recent radiation into new environments. To our knowledge, this study is the first report of a substantial lag phase being investigated below the family level

    Genomic repeat abundances contain phylogenetic signal

    Get PDF
    A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution

    Independent, Rapid and Targeted Loss of Highly Repetitive DNA in Natural and Synthetic Allopolyploids of Nicotiana tabacum

    Get PDF
    Allopolyploidy (interspecific hybridisation and polyploidy) has played a significant role in the evolutionary history of angiosperms and can result in genomic, epigenetic and transcriptomic perturbations. We examine the immediate effects of allopolyploidy on repetitive DNA by comparing the genomes of synthetic and natural Nicotiana tabacum with diploid progenitors N. tomentosiformis (paternal progenitor) and N. sylvestris (maternal progenitor). Using next generation sequencing, a recently developed graph-based repeat identification pipeline, Southern blot and fluorescence in situ hybridisation (FISH) we characterise two highly repetitive DNA sequences (NicCL3 and NicCL7/30). Analysis of two independent high-throughput DNA sequencing datasets indicates NicCL3 forms 1.6–1.9% of the genome in N. tomentosiformis, sequences that occur in multiple, discontinuous tandem arrays scattered over several chromosomes. Abundance estimates, based on sequencing depth, indicate NicCL3 is almost absent in N. sylvestris and has been dramatically reduced in copy number in the allopolyploid N. tabacum. Surprisingly elimination of NicCL3 is repeated in some synthetic lines of N. tabacum in their forth generation. The retroelement NicCL7/30, which occurs interspersed with NicCL3, is also under-represented but to a much lesser degree, revealing targeted elimination of the latter. Analysis of paired-end sequencing data indicates the tandem component of NicCL3 has been preferentially removed in natural N. tabacum, increasing the proportion of the dispersed component. This occurs across multiple blocks of discontinuous repeats and based on the distribution of nucleotide similarity among NicCL3 units, was concurrent with rounds of sequence homogenisation

    Next-generation sequencing reveals the impact of repetitive DNA in phylogenetically closely related genomes of Orobanchaceae

    Get PDF
    We used next-generation sequencing to characterize the genomes of nine species of Orobanchaceae of known phylogenetic relationships, different life forms, and including a polyploid species. The study species are the autotrophic, nonparasitic Lindenbergia philippensis, the hemiparasitic Schwalbea americana, and seven nonphotosynthetic parasitic species of Orobanche (Orobanche crenata, Orobanche cumana, Orobanche gracilis (tetraploid), and Orobanche pancicii) and Phelipanche (Phelipanche lavandulacea, Phelipanche purpurea, and Phelipanche ramosa). Ty3/Gypsy elements comprise 1.93%–28.34% of the nine genomes and Ty1/Copia elements comprise 8.09%–22.83%. When compared with L. philippensis and S. americana, the nonphotosynthetic species contain higher proportions of repetitive DNA sequences, perhaps reflecting relaxed selection on genome size in parasitic organisms. Among the parasitic species, those in the genus Orobanche have smaller genomes but higher proportions of repetitive DNA than those in Phelipanche, mostly due to a diversification of repeats and an accumulation of Ty3/ Gypsy elements. Genome downsizing in the tetraploid O. gracilis probably led to sequence loss across most repeat types

    Linked read technology for assembling large complex and polyploid genomes

    Get PDF
    Background: Short read DNA sequencing technologies have revolutionized genome assembly by providing high accuracy and throughput data at low cost. But it remains challenging to assemble short read data, particularly for large, complex and polyploid genomes. The linked read strategy has the potential to enhance the value of short reads for genome assembly because all reads originating from a single long molecule of DNA share a common barcode. However, the majority of studies to date that have employed linked reads were focused on human haplotype phasing and genome assembly. Results: Here we describe a de novo maize B73 genome assembly generated via linked read technology which contains ~ 172,000 scaffolds with an N50 of 89 kb that cover 50% of the genome. Based on comparisons to the B73 reference genome, 91% of linked read contigs are accurately assembled. Because it was possible to identify errors with \u3e 76% accuracy using machine learning, it may be possible to identify and potentially correct systematic errors. Complex polyploids represent one of the last grand challenges in genome assembly. Linked read technology was able to successfully resolve the two subgenomes of the recent allopolyploid, proso millet (Panicum miliaceum). Our assembly covers ~ 83% of the 1 Gb genome and consists of 30,819 scaffolds with an N50 of 912 kb. Conclusions: Our analysis provides a framework for future de novo genome assemblies using linked reads, and we suggest computational strategies that if implemented have the potential to further improve linked read assemblies, particularly for repetitive genomes
    • …
    corecore